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Abstract. We examine statistical properties of the Calkin-Wilf 
tree and give number-theoretical applications. 



I. A MEAN- VALUE RELATED TO THE CALKIN-WlLF TREE 

The Calkin-Wilf tree is generated by the iteration 
a a a + b 

b a + b b 

starting from the root \\ the number -+r is called the left child of # 

° 1 ' a+b b 

and g ^ the right child; we also say that | is the mother of its children. 
Recently, Calkin & Wilf pQ have shown that this tree contains any 
positive rational number once and only once, each of which represented 
as a reduced fraction. The first iterations lead to 




Reading the tree line by line, the Calkin-Wilf enumeration of Q + starts 
with 

112132314352534 
1' 2' T' 3' 2' 3' 1' 4' 3' 5' 2' 5' 3' 4' T' "' 
As recently pointed out by Reznick [10], this sequence was already 
investigated by Stern [T2] in 1858. This sequence satisfies also the 
iteration 

xi = 1, x n+1 = l/(2[x n ] + 1 - x n ), 
l 
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where [x] denotes the largest integer < x; this observation is due to 
Newman (cf. [8]), answering a question of D.E. Knuth, resp. Van- 
dervelde & Zagier (cf. [TT]). 

The Calkin-Wilf enumeration of the positive rationals has many in- 
teresting features. For instance, it encodes the hyperbinary represen- 
tations of all positive integers (see [I]). Furthermore, it can be used 
as model for the game Euclid first formulated by Cole & Davie [2] ; see 
Hofmann, Schuster & Steuding [5] . In this short note we are concerned 
with statistical properties of the Calkin-Wilf tree. 

We write the nth generation of the Calkin-Wilf tree as CW^ = 
{x^}j, where the x^} are the elements ordered according to their ap- 
pearance m the nth line of the Calkin-Wilf tree. So Q + = U^=i CWH 
Obviously, CW^ consists of 2 n ~ l elements. Denote by S(n) the sum 
of all elements of the nth generation of the Calkin-Wilf tree, 

2 n-l 

E(n) = ]Taf . 
i=i 

Our first result gives the mean- value of the elements of the rath gener- 
ation of the Calkin-Wilf tree: 

Theorem 1. For any n£N, 

S(n) = 3-2 n - 2 --. 

This result may be interpreted as follows. We observe that x± = - and 
x^-i = f f° r an 71 e N, and thus CW^ is supported on an unbounded 
set as n — > oo. However, the average value of the 2 n ~ l elements of 
the nth generation CW^ is approximately |, which is, surprisingly, 
a finite number. This has a simple explanation: in some sense, small 
values are taken in earlier generations than large values. For instance, 
in each generation CW^ takes as many values form the interval (0, 1) 
as from (l,oo). This result was also recently proved by Reznick [TO] ; 
his proof differs slightly from our argument o 

1 Thc problem of determining the average value of the Calkin-Wilf tree was 
posed by the second named author as a problem in the problem session of the IV 
International conference on analytic and probabilistic number theory in Palanga 
2006; an independent solution was given by Eduard Wirsing. 
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Proof by induction on n. The statement of the theorem is correct for 
n — 1 and n = 2. Now suppose that n > 3. In order to prove the 
statement for n we first observe a symmetry in the Calkin- Wilf tree 
with respect to its middle: for n > 2, 

(n)_a (n) _6. n s 

this is easily proved by another induction on n (and we leave its simple 
verification to the reader). Further, we note that Xj < 1 if and only 
if j is odd; here equality holds if and only if n = 1. 

Now we start to evaluate E(n). For this purpose we compute 

» i J n ) f nr . A _ 1 O on-2 



(n) I ^ + forj = l,2,...,2^-l, 

^ ' \ xf>+x£) forj = 2™- 2 , 

and add these values over j = 1, 2, . . . , 2 n ~ 2 . Clearly, S(n) = Y^j=i Vj^- 
First, assume that j is odd. Then both, x ™ and rr 2 ^_i . are strictly 
less than 1. In view of ([1]) the mothers of x^ and x^_ 1 _. are of the 
form | and -, respectively. Hence, 

j +X ^-3 a + b + a + b 



,(») 



and thus we find = 1 in this case. 

Next, we consider the case that j is even. Then both, xT 1 and x" n 
are strictly greater than 1. If the mothers of x^ and a^n-i- • are °f the 
form | and respectively, then 

(n) o + b a (n ) a' 

x) = —g- = 1 + - and z^.!^. = 1 + ^7- 

Hence, we find for their sum 

(n) (n) _ „ a <^ 
x i + :r 2»- 1 -j ~ b V 

and so = 2 + y^~ l \ where y^~^ is either the sum of two elements 

x^ and ^2™-2-fc or ^ ne sum °f ^"-^ anc ^ x 2»-2 ■ 

It remains to combine both evaluations. Since both cases appear 
equally often, namely each 2 n ~ 3 times, we obtain the recurrence formula 

E(n) = E(n- !) + (! + 2)- 2 n ~ 3 , 
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being valid for n > 3. This implies the assertion of the theorem. ■ 



2. An application to finite continued fractions 

Theorem [1] has a nice number-theoretical interpretation. It is well- 
known that each positive rational number x has a representation as a 
finite (regular) continued fraction 

1 

x = ao H 

ai + . 

''+ — hn- 

am 

with Oo G NU {0} and aj G N for some m G N U {0}. In order to have 
a unique representation, we assume that a m > 2 if m G N. We shall 
use the standard notation x = [ao, a±, . . . , a m }. Continued fractions are 
of special interest in the theory of diophantine approximation. 

As Bird, Gibbons & Lester [3] showed, the nth generation of the 
Calkin-Wilf tree consists exactly of those rationals having a continued 
fraction expansion [ao, ai, . . . , a m ] for which the sum of the partial quo- 
tients aj is constant n, the continued fractions of even length in the 
left subtree, and the continued fractions with odd length in the right 
subtree. Thus Theorem [T] yields 

Corollary 2. For any n£N, 

2 1 -" Yl [ao,a u ...,a m ] = ^-2- n . 

ao+a\ + ...+a m =n 

One can use the approach via continued fractions to locate any positive 
rational in the tree. This observation is due to Bird, Gibbons & Lester 
[3] (actually, their reasoning is based on Graham, Knuth & Patasnik [1] 
who gave such a description for the related Stern-Brocot tree). Given 
a reduced fraction x in the Calkin-Wilf tree with continued fraction 
expansion 

x = [ao , Qi, . . . , a m —2, a m —i, a m \ , 
we associate the path 

|_a m -l R a m _ lL a m - 2 . . . |_a 1R ao j f m j g odd; &nd 
R a m -l|_a m _ 1R a m _ 2 . . . RB1 |_a j f m j g even; 



STATISTICAL PROPERTIES OF THE CALKIN-WILF TREE 



5 



note that a m - 1 > 1 for m G N. The notation R a with a G N U {0} 
means: a steps to the right, whereas L 6 with b G N U {0} stands for b 
steps to the left. Then, starting from the root y and following this path 
from left to right, we end up with the element x. This follows easily 
from the iteration with which the tree was build; notice that this claim 
is essentially already contained in Lehmer [7] (this was also observed 
by Reznick [ID]). 

Corollary 3. Given any non-empty interval (oe,/3) in 1R + , and any 
finite path in the Calkin- Wilf tree, there exists a continuation of this 
path which contains a rational number from the interval (a,/3). 

Proof. We expand a and (3 into continued fractions, a = [a ,ai, . . .] 
and P = [b ,bi, . . .], say. Let k be the least index such that ^ b^. 
According to the parity of k we have < bk (if k is even) or at > bk (if 
k is odd). Without loss of generality we may assume that \bk — ajt| > 2 
(since otherwise we may consider a subinterval of (a,/3)). Moreover 
we may suppose that the path in question is starting from the root 
and is given in the form L Cm_1 R Cm " 1 L Cm_2 ■ ■ • |_ Cl R c ° (the other case may 
be treated analogously). Then we construct a rational number x by 
assigning the finite continued fraction 

x = [a , O-x, . . . , Ofc_i, Xfc, Co, Ci, . . . , C m _2, C m _i, C m J, 

where Xk '■— min{afc, bk} + 1 and Xk+i denotes the string 1 if k is odd, 
resp. 1,1 if A; is even. Since bj = aj for < j < k and 

min{a fc ,6 fc } < x k < max{a fc ,6 fc }, 

it follows that a < x < (3. Since the length of the continued fraction 
expansion has the same parity as m (thanks to the definition of Xk+i), 
the element x can be reached by the path L Cm ~ 1 R Cm - 1 L Cm - 2 ■ ■ • L Cl R c °. 
This proves the corollary. ■ 

3. A RANDOM WALK ON THE CALKIN-WlLF TREE 

Starting with X\ = y, we define a sequence of random variables by 
the following iteration: if X n = |, then X n+ \ = with probability 
| and X n+ i = with probability |. The sequence {X n } may be 
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regarded as a random walk on the Calkin-Wilf tree where n is a discrete 
time parameter. 

Theorem 4. Let (a, ($) be any non-empty interval in IR + . Then, with 
probabilty 1, the random walk {X n } visits the interval (a, (3), i.e., with 
probability 1, there exists m G N such that x m G (ct,/3). 

Proof. The interval (a, (3) contains a non-empty subinterval [A, B] 
such that for any ( G [A, B] the initial partial quotients c ,c 1 , . . . ,c m 
are identical: ( = [c , Ci, . . . , c m , . . .]. Hence, with the interval [A, 5] we 
may associate a path pattern |_ Cm_1 R Cm - 1 L Cm - 2 • • • L Cl R c ° in the Calkin- 
Wilf tree such that any path in the tree starting from the root and 
ending with |_ Cm_1 R Cm - 1 L Cm - 2 • • • |_ Cl R c ° points to an element in [A, B}. 
Since the probability is \ for both | i— > ^ and | i— > each pattern 
of fixed length m appears with the same probability and so we may 
restrict on the path pattern R k . 

In the case k — 1 we find in each generation exactly one which ends 
with R but does not contain any R before (actually, this is L a ~ 1 R in 
generation n). Adding up all probabilities for these paths, we get 



2 

and so, with probability 1, the random walk X n will go to the right 
child for some n. Now assume that the statement is true for k. We shall 
show that then it is also true for k + 1. For each path of the form XR k , 
where X is any combination of powers of L and R, there are two paths 
XR k L and XR k+1 , so by induction the probability that the random 
walk eventually follows the path R k+1 is at least \. However, for each 
path XR k one also has to consider the subtrees starting from XR k L d 
for d = 1,2,..., each of which containing paths which end R k+1 . By 
self-similarity, the probability that the random walk eventually follows 
the path R k+1 is 

i 00 



This proves the theorem. 
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4. Statistical properties of the Calkin-Wilf tree 

In view of Corollary [2] it is interesting to have a better understanding 
of the statistics of the Calkin-Wilf tree. The following theorem gives 
the limit distribution function in explicit form. 

Theorem 5. Let F n (x) denote the distribution function of the n—th 
generation, i.e., 

F n (x) = 2 1 ""#{j : xf ] < x}. 

Then uniformly F n (x) — > F(x), where 

F{[a , a u o 2 , a 3 , ...]) = 1 - 2~ a ° + 2" (ao+ai) - 2-( ao+ai+a2 ) + ... 

(for rational numbers x = [ao, Oi, . . .] this series terminates at the last 
non-zero partial quotient of the continued fraction). Thus, F(0) = 0, 
F(oo) = 1, and F(x) is a monotonically increasing function. Moreover, 
F(x) is continuous and singular, i.e., F'(x) = almost everywhere. 

Proof. Let x > 1. One half of the fractions in the n + 1st generation 

do not exceed 1, and hence also do not exceed x. Further, 

a + b a 
— : — < x -<=>- — < x — 1. 
b b 

Hence, 

2F n+1 {x) = F n {x - 1) + 1, n > 1. 

Now assume < x < 1. Then 

a ax 
< x <^=^ - < 



a + b b 1 — x 

Therefore, 

2F n+1 {x) = F n (-^- 
V 1 — X/ 

The distribution function F, defined in the formulation of the theorem, 
satisfies the functional equation 



2F(x) 



F(x-1) + 1 if x > 1, 
F(^-) if 0<x<l. 

V 1 — x ' 



For instance, the second identity is equivalent to 2F(^-) = F(t) for 
all positive t. If t = [b , fei, ...], then ^ = [0, 1, b , & 1( ..] for t > 1, and 
= [0, bi + 1, b 2 , ...] for t < 1, and the statement follows immediately. 



GIEDRIUS ALKAUSKAS, JORN STEUDING 



Now define 5 n (x) = F(x) — F n (x). In order to prove the first assertion 
of the theorem, the uniform convergence F n — > F, it is sufficient to show 
that 

sup|5 n (x)| <2~ n . (2) 

x>0 

It is easy to see that the assertion is true for n = 1. Now suppose 
the estimate is true for n. In view of the functional equation for both 
F n (x) and F(x), we have 



2S n+1 (x) = 5„( 



x 



1 — X 



for < x < 1, which gives sup 0<;c<1 |5 n+ i(x)| < 2 n 1 . 
have 



Moreover, we 



25 n+1 (x) = S n (x-l) 

for x > 1, which yields the same bound for S n (x) in the range x > 1. 
This proves (J2j). 

Clearly, F, as a distribution function, is monotonic; obviously, it is 
also continuous. It remains to prove that F(x) is singular. Given an 
irrational number a = [a , a±, a 2 , ...], we consider the sequence 

obviously, a n is the real number which is defined by the continued 
fraction expansion of a, where the nth partial quotient a n is replaced 
by a n + 1. Depending on the parity of n, a n is less than or greater 
than a. Thus, any real number y, which is sufficiently close to a, is 
contained between two terms of the sequence, ai and aL+2 say. Then 



F(y) - F(a) 



y-a 



< 



F(a L ) - F(a) 



«L+2 



a 



From the explicit form of F we deduce 



\F(a L ) - F(a)\ < -2^ a ° +a ' + - +aL \ 
On the other hand, 

|«l+2 -Oi\ > ([ax, a 2 , a L+2 + 1, ...] - [ax, a 2 , a L+2 , ...])(a + 1)~ 2 



> (a + l)(ai + l)...(a L+2 + r 
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by induction. Thus, 



y — a 

The theorem of Khinchin ([9], p. 86, implies that nr=i( a « + 1) 1 ^™ tends 
to a fixed constant limit almost everywhere. On the other hand, the 
same reasoning shows that - Y17=i fln tends to infinity for almost all x. 
Thus, almost everywhere the limit 

\im(F(y)-F(a))(y-a)- 1 
exists and is equal 0. This finishes the proof of the theorem. ■ 

By the same argument as for the singular behaviour of F we can 
show that F'( v ^ 2 +1 ) = oo. Actually, the terms of CW^ are densely 
concentrated around numbers with F'(x) = oo and scarcely around 
those where F'(x) = 0. The value of F(x) is rational iff x is either 
rational or quadratic irrationality, e.g. 

Hl) = l, F ^) = ^ F((V5 + l)/2) = 2 -. 

This follows immediately from Lagrange's theorem which characterizes 
the quadratic irrationals by their eventually periodic continued fraction 
expansion. For Euler's number e = [2, 1, 2n, 1] we find that F(e) can 
be expressed in terms of special values of Jacobi theta functions. 

5. Characteristics of the distribution function 



In view of Corollary [2J the mean of the distribution function F is 
|. Since F has a tail of exponential decay, more precisely 1 — F(x) = 
0(2~ x ), it follows that all moments exist. For k £ No, the kth moment 
is defined by 

oo 


In order to give an asymptotic formula for let 

oo 



o 
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We will see that the generating function of m k has some interesting 
properties. Let u>(x) be a continuous function of at most power growth: 
uj{x) <C x T as x — > oo. By the functional equation for F we find 
F( x + n) = 1- 2- n + 2~ n F(x), x>0. Hence 

/oo „ 
uj{x) dF(x) = / u(x + n) dF(x + n) 

"=° 

1 



n «=0 



these integrals exist in view of our assumptions and the fact that F(x) 
has a tail of exponential decay. Let x = ^ for t > 0. Since ^(^x) = 
this change of variables gives 



/ dF(x) = / dF{t) 
n=0 

(All changes of order of summation and integration are justified by the 
condition we put on u>(x)). Now let u(x) = x L for some I 6 No and 
define 



00 „ 
n s 



?1=0 



Then 

00 



/^> = /E^)'(tK< d ^ 



i=0 

whence the relation 

L 



M L = Y,^[ L -)b L - % (3) 



i=0 



for L E No- The generating function of the sequence of the b s is given 
by 



w = E^ L -EE^ = E 



L=0 L=0 n=0 n=0 
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Denote by M(t) and m(t) the corresponding generating functions of 
the coefficients and m&, respectively. Then we can rewrite ([3]) as 

M(t) = y ^t L = - J- y ^ = 

L=0 L=0 

The function m(i) is entire, and M(t) has a positive radius of con- 
vergence. This already allows us to find approximate values of the 
moments Ml- 



Theorem 6. For L G N , 

M ^ m(log2)f 1 V ' 



L! + O e (((4vr 2 + (log2) 1 / 2 - eY^D- 



2 log 2 Vlog2 

V 2 log 2 Vlog2/ v V 
Proof. By Cauchy's formula, for any sufficiently small r, 

Mr = / — \± dz. 



M L - -Res, =log2 y {2 _ e , )zL+1 J - ^ _/ ^liZTT 



2vri 7 k | =r 2 L+1 

Changing the path of integration, we get by the calculus of residues 

m(z) ^ L\ f m(z) dz 

l\z\=R 

where R satisfies log 2 < R < \ log 2 + 2iri\ (which means that there is 
exactly one simple pole of the integrand located in the interior of the 
circle \z\ = R). It is easily seen that the residue coincides with the 
main term in the formula of the lemma; the error term follows from 
estimating the integral. ■ 

We obtain the inverse to the linear equations (J3]): 

m L = Mi-^M, 

for L 6 No. Since b{t){2 — e*) = 1, the coefficients bi can be calculated 
recursively 

L-l 



Thus, b = 1, bx = 1, b 2 = 3, 6 3 = 13, 6 4 = 75, 6 5 = 541. 
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We proceed with a property of the function m(t) which reflects the 
symmetry of the distribution function: F(y) + F(l/y) = 1. Unfortu- 
nately, this property is still insufficient for determining the coefficients 
rriL- As a matter of fact, 

oo oo 

m L = I (^—) L &F{x) = - / (J^-) L dF(l/x) 



.x + lJ v ' J \l/x + 1 



oo 

1 \ L 

dF{x) 



x + 1 







Since 



X \L fX+l-l\L ^( rjL V-l) r - / 1 ^ 



x + 1/ V x+1 / / — ' \s J \x + 1 

s=0 N ' 



this gives 



L 'L 



s=0 

for L > 0. For example, mi = mo — mi, which gives mi = | (since 
mo = 1), and thus Mi = | (see Theorem [1]). For the other coefficients 
we only get linear relations. Thus, 2m 3 = — | + 3m2. In terms of m(£) 
the recursion formula above yields the identity 

m(t) = m(—t)e t . 

We conclude this chapter with the result, which uniquely determines 
the function m(t) (along with the condition m(0) = 1). 

Theorem 7. The function m(s) satisfies the integral equation 

oo 

m(-s) = (2e s - 1) J m\-t)J (2Vsi) dt, s e R+, 
o 

where Jq(*) stands for the Bessel function: 

7T 

Jq{z) = — cos(zsina;) dx 
vr I 



o 
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The proof of this theorem and the solution of this integral equation, 
and thus the explicit description of the moments will be given in a 
subsequent paper. 

6. p-ADIC DISTRIBUTION 

In the previous sections, we were interested in the distribution of the 
nth generation of the tree CW in the field of real numbers. Since the 
set of non-equivalent valuations of Q contains a valuation, associated 
with any prime number p, it is natural to consider the distribution of 
the set of each generation in the field of p— adic numbers Q p . In this 
case we have an ultrametric inequality, which implies that two circles 
are either co-centric, or do not intersect. We define 

F n (z, u) = 2-™ +1 #{^ G CWW : ord p (^ -z)>u}, z G Q p , v G Z. 

(When p is fixed, the subscript p in F n is omitted). Note that in 
order to calculate F n (z, v) we can confine to the case ord p (z) < v\ 
otherwise ord p (| — z) > v ord p (|) > v. We shall calculate the limit 
distribution [i p {z,v) = lim^oo F n (z, v), and also some characteristics 
of it, e.g. the zeta function 

Z p (s) = J \u\ s dfx p , s G C, z G Q p , 

where | * | stands for the p— adic valuation. 

To illustrate how the method works, we will calculate the value of F n 
in two special cases. Let p — 2, and let E(n) be the number of rational 
numbers in the nth generation with one of a or b being even, and let 
0(n) be the corresponding number fractions with both a and b odd. 
Then E(n) +0{n) = 2 n ~ 1 . Since | in the nth generation generates ^ 
and ^ in the (n + l)st generation, each fraction | with one of the a, b 
even will generate one fraction with both numerator and denominator 
odd. If both a, b are odd, then their two offsprings will not be of this 
kind. Therefore, 0(n+l) = E(n). Similarly, E(n + 1) = E(n) + 20(n). 
This gives the recurrence E(n + 1) = E(n) + 2E(n — 1), n > 2, and 
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this implies 

9 n + 2f— ~\) n 2 n ~ 1 4- 2(— 1 V rt_1 2 

E(n) = 2 +2 3 ( 1J , 0(n) = 2 +2 3 ( lj , ^(0,0) = !- 

(For the last equality note that | and - simultaneously belong to 
OV^, and so the number of fractions with ord2(*) > is E(n)/2). 
We will generalize this example to odd prime p > 3. Let Lj(n) be the 
part of the fractions in the nth generations such that ab^ 1 = i mod p 
for < 2 < p — 1 or z = oo (that is, b = mod p). Thus, 

iSFpUoo 

in other words, Li(n) = F n (i,l). For our later investigations we need 
a result from the theory of finite Markov chains. 

Lemma 1. Let A be a matrix of a finite Markov chain with s stages. 
That is, aij > 0, and X]j=i a « ,j = 1 f or a ^ Suppose that A is 
irreducible (for all pairs and some m, the entry a ™j of the matrix 

A m is strictly positive), acyclic and recurrent (this is satisfied, if all 
entries of A m are strictly positive for some m). Then the eigenvalue 1 
is simple, if X is another eigenvalue, then |A| < 1, and A m , as m — » oo ; 
tends to the matrix B ; with entries b^j = Tij, where (tti, 7r s ) is a 
unique left eigenvector with eigenvalue 1, such that Ylj=i 71 j = 1- 

A proof of this lemma can be found in [6], Section 3.1., Theorem 1.3. 

Theorem 8. fJ, p (z, 1) = for z G Z p . 

Proof. Similarly as in the above example, a fraction | from the nth 
generation generates ^ and 2 ^ in the (n + l)st generation, and it is 
routine to check that 

Li(n + 1) = ]-L_^(n) + -Li_i(n) for iGF p U{oo}, (4) 

(Here we make a natural convention for and i — 1, if z = 1 or 
oo). In this equation, it can happen that z — 1 = mod p; thus, 
(2i — l) 2 = —3 mod p. The recurrence for this particular i is to be 
understood in the obvious way, Li(n + 1) = Lj_i(n). Therefore, if we 
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denote the vector-column (L 0O (n), L (n), L p _i(n)) T by v n , and if A 
is a matrix of the system (BJ, then v n+ i = ^4v n , and hence 

where vi = (0, 0, 1, 0, 0) T . In any particular case, this allows us two 
find the values of Lj explicitly. For example, if p = 7, the characteristic 
polynomial is 

f(x) = —{x - l)(2x - l)(2x 2 + l)(4x 4 + 2x 3 + 2x + 1). 
16 

The list of roots is 



1 . % -1 - VT7 , V 1 + VTT 

«i = 1> « = " 3 > 4 = ±_ yf ' a5 > 6 ' 7 ' 8 = § ± — — ' 

(with respect to the two values for the root y/Tf), the matrix is diago- 
nalisible, and the Jordan normal form gives the expression 

8 

8=1 

Note that the elements in each row of the (p + 1) x (p + 1) matrix ^4 are 
non- negative and sum up to 1, and thus, we have a matrix of a finite 
Markov chain. We need to check that it is acyclic. Let r{i) = % — 1, 
and oil) = y+ for ieF p U {oo}. The entry ajj of ^4 m is 

(m) _ \ ^ 

&■ ■ / di j-. ' &i, j • ... CI 



i,j / j "VI U '«1,12 ••• "'im-lj 1 

ii,...,i m _i 



Therefore, we need to check that for some fixed m, the composition of 
m a's or r's leads from any i to any j. One checks directly that for 
any positive k, and i, j G ¥ p , 

tP- 1 -! o a o r k o (j o r l_1 (i) = j, 
r rl_J o a o ^(oo) = j, 
r k o a o r l_1 (i) = oo; 

(for i = 0, we write r -1 for r p_1 ). For each pair choose k in 

order the amount of compositions used to be equal (say, to m). Then 
obviously all entries of A m are positive, ant this matrix satisfies the 
conditions of lemma. Since all columns also sum up to 1, (tti, 7T p +i), 
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TTj = -j^-, 1 < j ' < p + 1, is the needed eigenvector. This proves the 
theorem. ■ 



Theorem 9. Let v G Z and z G Q p , and ordp(z) < v (or z = 0). 
Then, if z is p—adic integer, 

y.{z,v) 



If z is not integer, ord p (z) = —A < ; 
For z = 0, —v < 0, we have 

1 



MO, 



pU+l _|_ pU 

This theorem allows the computation of the associated zeta-function: 
Corollary 10. For s in the strip — 1 < SRs < 1, 

b-i) 2 



Z p (s) = / \u\ s d/j, t 
JueOr, 



ip-p s )(p-p s )' 



and Z p (s) = Z p ( 



The proof is straightforward. It should be noted that this expression 
encodes all the values of /^(O, v) for v G Z. 

Proof of Theorem For shortness, when p is fixed, denote ord p (*) 
by v(*). As before, we want a recurrence relation among the numbers 
F n (i,K), i G Q+. For each integral k, we can confine to the case 
i < p K . If i — 0, we only consider k > and call these pairs (i, k) 
"admissible". We also include G n (0, —k) for k > 1, where these values 
are defined in the same manner as F n , only inversing the inequality, 
considering | G CW^ n \ such that < — k; the ratio of fractions 

in the nth generation outside this circle. As before, a fraction | in 
the nth generation generates the fractions and in the (n + l)st 
generation. Let r(i, n) = ((i — 1) mod p K , k). Then for all admissible 
pairs (i, k), % ^ 0, the pair r(i, k) is also admissible, and 
,a + b a 

V{ : t) = K <£4> V{- U — 1)) = ft. 
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Second, if ^ — i + p K u, i ^ 1, u G Z p , and (i, k) is admissible, then 



a+6 

a i p K w 



6 1 — % (1 — i)(l — % — p K u) 

Since = v(i) — v(l — i), this is unless % is an integer, equals 

to v(i) if the latter is > and equals to — v(l — i) if v(l — i) > 0. 
Further, this difference has valuation > kq = k, if i G Z, i ^ 1 mod p, 
valuation > k = « — 2t>(l — i), if i G Z, i = 1 mod p, and valuation 

> k = k — 2t>(?) if « is not integer. In all three cases, easy to check, 
that, if we define i = j-: mod p K °, the pair a(i, k) = dcf (io, kq) is 
admissible. For the converse, let | = i + p K °u, u G Z p . Then 

a «o P K ° 

a + 6 l + i ~ (l + i + p^u){l + i ) ' 

If i = is a p— adic integer, i ^ 1 mod p, this has a valuation 

> k = K ; if i is a p— adic integer, i = l(p), this has valuation 

> k = k — 2i>(i ) = /to + 2f (1 — i); 
if i is not a p— adic integer, this has valuation 

> K = Kq — 2t>(l + i ) = Kq + 2w(i). 



Thus, 



a 

— i) > k f (- — -i ) > 



a + 6 6 
Let i = l. If ^ = 1 + p K M, then k > 0, u £ Z p , and we obtain 

| = — 1 — f (|) < — k. Converse is also true. Finally, for k > 1, 

.a + 6. .a. 

and 

— ^—r) < -K <^> + 1) > K. 

v a + b' ~ v 6 7 - 

Therefore, we have the recurrence relations: 

F n+ i(i, k) = ^F n (r(i, k)) + ^F n (a(i, «)), if (i, /c) is admissible, 
F n+1 (l, k) = lF n (0, k) + ±G n (0, -«), k > 1, (5) 
G„+i(0, -«) = ±G n (0, -«) + |F n (-l, «), k > 1. 

Thus, we have an infinite matrix «4, which is a change matrix for the 
Markov chain. If v„ is an infinite vector-column of F^s and G' n s, then 
v n+ i = Av n) and, as before, v n = „4 n_1 Vi. It is direct to check that 
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each column also contains exactly two nonzero entries |, or one entry, 
equal to 1. In terms of Markov chains, we need to determine the classes 
of orbits. Then in proper rearranging, the matrix A looks like 



/Pi 


.. 


. 


...\ 





P 2 


. 







. 


. p s 





V : 




. 


•••/ 



where P s are finite Markov matrices. Thus, we claim that the length of 
each orbit is finite, every orbit has a representative G*(0, —k), k > 1, 
the length of it is p K + p K ~ 1 , and the matrix is recurrent (that is, every 
two positions communicate). In fact, from the system above and form 
the expression of the maps r(i,K) and a(i, «), the direct check shows 
that the complete list of the orbit of G*(0, —k) consists of (and each 
pair of states are communicating): 

G,(0,-k), 
F,(z» (« = 0,l,2,...,p K -l), 
F*(p~ x u, K-2X) (A = 1, 2, k - 1, u G N, u ^ mod p, u < p K ~ x ). 
In total, we have 

K-l 

i+ p k + Y,(p K ~ x - p^ 1 ) =p K + 

members in the orbit. Thus, each P K in the matrix above is a finite 
dimensional i K x l K matrix, where l K = p K +p K ~ 1 . For k — 1, the 
matrix Pi is exactly the matrix of the system (jlj). As noted above, the 
vector column (1, 1, 1) T is the left eigen-vector. As in the previous 
theorem, it is straightforward to check that this matrix is irreducible 
and acyclic (that is, the entries of P" are strictly positive for sufficiently 
large n). In fact, since by our observation, each two members in the 
orbit communicate, and since we have a move G*(0, —k) — ► G*(0, —k), 
the proof of the last statement is immediate: there exists n such that 
any position is reachable from another in exactly n moves, and this 
can be achieved at the expense of the move just described. Therefore, 
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all entries of P™ are strictly positive. Thus, the claim of the theorem 
follows from the lemma above. ■ 
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